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ABSTRACT 

We discuss how the redshift dependence of the observed two-point correlation 
function of various classes of objects can be related to theoretical predictions. 
This relation involves first a calculation of the redshift evolution of the under- 
lying matter correlations. The next step is to relate fluctuations in mass to 
those of any particular class of cosmic objects; in general terms, this means 
a model for the bias and how it evolves with cosmic epoch. Only after these 
two effects have been quantified can one perform an appropriate convolution 
of the non-linearly evolved two-point correlation function of the objects with 
their redshift distribution to obtain the 'observed' correlation function for a 
given sample. This convolution in itself tends to mask the effect of evolution 
by mixing amplitudes at different redshifts. We develop a formalism which 
incorporates these requirements and, in particular, a set of plausible models 
for the evolution of the bias factor. We apply this formalism to the spatial, 
angular and projected correlation functions from different samples of high- 
redshift objects, assuming a simple phcnomcnological model for the initial 
power-spectrum and an Einstein-de Sitter cosmological model. We find that 
our model is roughly consistent with data on the evolution of QSO and galaxy 
clustering, but only if the effective degree of biasing is small. We discuss the 
differences between our analysis and other theoretical studies of clustering evo- 
lution and argue that the dominant barrier to making definitive predictions 
is uncertainty about the appropriate form of the bias and its evolution with 
cosmic epoch. 

Key words: cosmology: theory - cosmology: observations - large-scale struc- 
ture of Universe - galaxies: formation - galaxies: evolution - galaxies: haloes 



1 INTRODUCTION 

The analysis of the clustering properties of various classes of objects at high redshift is rapidly becoming one of 
the key tests of models for the formation and present properties of the large-scale structure of the universe. The 
importance of such studies is that they offer the prospect of highlighting differences between viable theoretical 
models which, by construction, display clustering behaviour which is very similar at the present epoch (e.g. Coles 
1996). Improvements in observational technology also offer the prospect of providing strong constraints on the nature 
of any bias that exists between the clustering of objects of a particular type and that of the underlying matter 
distribution. The possible existence of a significant bias in the galaxy distribution is a presently a significant obstacle, 
for example, to the determination of the cosmological density parameter flo from large-scale clustering data (e.g. 
Coles & Ellis 1996). This second point is particularly relevant because of the wide range of different objects which are 
now accessible to systematic redshift survey programmes - not only the 'traditional' bright, optically selected galaxies 
upon which surveys were based in the 1980s, but now also faint optical galaxies, galaxies selected in the infrared, 
QSOs, absorption-line systems, radio-galaxies and clusters of galaxies selected either optically or through their X-ray 
emission. Each of these classes of objects might be related to the matter distribution in a different way, a possibility 
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that, on the one hand, poses great difficulties of interpretation but, on the other, admits the possibility of reafising 
many independent tests of theories for the formation of structure. 

In spite of the many important recent observational developments, theoretical models used to interpret the 
data have so far generally been naive and poorly motivated in terms of galaxy formation scenarios. This theoretical 
inadequacy is clearly shown by the wide use, primarily in the literature relating to observational data, of the model 

e(r,z)=C(r/(l + z),0)(l + z)-(^+^) (1) 

for the redshift evolution of the two-point correlation function ^(r, z) at the comoving separation r, where e is an 
arbitrary fitting parameter. If the space dependence of the two-point function can be fitted by a power law, the above 
relation simplifies to 

C(r,z) = (r/rJ-^(l + z)-(3-^+^' , (2) 

where is a constant measuring the unity crossing of ^ at z; = 0. 

Recent observational studies have served to highlight the importance of understanding the validity (or otherwise) 
of the simple models (1) & (2). For example, using equation (|^) as a fitting formula for the two-point function of faint 
galaxies, various authors (e.g. Efstathiou et al. 1991; Neuschaefer, Windhorst & Dressier 1991; Couch, Jurcevich & 
Boyle 1993; Roche et al. 1993; Bernstein et al. 1994; Cole et al. 1994; Brainerd, Small & Mould 1995; Le Fevre et al. 
1996; Roche et al. 1996; Shepherd et al. 1996) have found observational evidence for an excess evolution with respect 
to the stable clustering prediction (e = 0), i.e. for positive values of e (the so-called collapsing models). In particular. 
Shepherd et al. (1996) and Le Fevre et al. (1996), from the analysis of galaxy clustering at moderate redshifts, 
obtained e ~ 1 ± 1. But is this a real (physical) discrepancy with realistic models, or would more sophisticated 
modelling of the evolution of ^{r, z) yield a different interpretation? We incline to the second view, for reasons we 
explore below, and suggest that the quantity and quality of the high-redshift data now available requires a significant 
improvement in the mathematical models deployed for their interpretation. 

In some ways this situation is reminiscent of the way some pre-COBE analyses of the CMB angular two- 
point function were carried out: in most experimental papers the data were fitted in terms of the so-called 'Gaussian 
correlation function', which had no justification in terms of theoretical models for the temperature anisotropy pattern. 
Likewise these simple power law models for the correlation function have relatively little theoretical motivation and 
a more realistic approach is consequently required. 

Assuming that clustering grows by gravitational instability, the above formula is only justified in two cases: for 
e = it reproduces the prediction of the so-called stable clustering model, while, for e = n + 2 = 7— 1, it results from 
the application of linear theory in an Einstein-de Sitter universe to purely scale-free power-spectra, Piin(fc, 0) oc k" . 
The case where e = 7 — 3 corresponds to a clustering pattern that simply expands with the background cosmology as 
if the galaxies were just painted on a homogeneous background. Concerning the first case, one should remember that 
the idea underlying the stable clustering ansatz is that, on sufficiently small scales, gravity acts to stabilize the number 
of neighbours of an object in a proper volume, after this has turned around from the universal expansion. Although 
the physical grounds of this model appear to be reasonably sound (e.g. Jain & Bertschinger 1996 and references 
therein; see, however, Padmanabhan et al. 1996), numerical simulations indicate that this type of dynamics is only 
relevant on scales where the mass autocorrelation function is at least as large as ~ 100 (e.g. Efstathiou et al. 1988), 
i.e. at most a small fraction of the typical scales probed by catalogues. The second model is also inadequate to treat 
the behaviour of correlation functions in any realistic galaxy formation scenario. 

Melott (1992) has analysed the growth of clustering in numerical simulations for an ensemble of scale-free models. 
He finds that the lower is the value of the spectral index n, the larger is the value of the parameter a = 3 — 7 + e 
and that positive values of e are easily allowed for in all models with n < 1. Melott 's explanation for such a fast 
clustering growth is as follows: stable clustering is not an upper limit to the growth of correlations; whenever the initial 
conditions contain non-vanishing large-scale power, merging makes new clusters form, their central density increases 
with time, which in turn enhances the growth of correlations. Using N-body simulations in the framework of the cold 
dark matter (CDM) model, Efstathiou (1995) has shown that faint blue galaxies can have lower correlation amplitude 
than normal luminous galaxies observed at the present day, as displayed by the observed value of e, provided that such 
faint galaxies are a transient population associated with dark matter haloes of low mass (less than ~ 10^"^ Mq) and 
rotation speed. A numerical study of the evolution of the two-point function both for the matter and halo population 
has been recently carried out by Coh'n, Carlberg & Couchman (1996); they obtain a scale-dependent e parameter 
which is about 1 for mass particles in an Einstein-de Sitter universe, and lower for low-density models. A broad 
range of values (ranging from —0.2 to 1 in the fiat case and reaching lower values in the open case) is obtained for 
haloes, depending on their mean density (see also Brainerd & Villumsen 1994). 

Recently, Jain (1996) has discussed the reliability of the general relation of equation (^ in the context of various 
models. His conclusions are that the above parameterization for the evolution of clustering is inaccurate in CDM-like 
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models, for two reasons. First, because the growth of S.ir,z) with time on intermediate scales is much faster than the 
(1 + z)~^ law prescribed by stable clustering at fixed proper separation (see also Peacock & Dodds 1996), second, 
because the boundary between the linear, mildly non-linear and stable clustering regimes, occurs at scales which 
rapidly change with time. 

A major advance in this field was represented by the work of Hamilton et al. (1991), which first provided a sort 

of somi-analytic model (sec also Nityananda & Padmanabhan 1994), able to interpolate between the very small- 
scale behaviour, accurately described by stable clustering, and the very large-scale (and/or early-time) one, which 
is expected to follow the simple prescriptions of linear perturbation theory. Of course, there also exist some more 
physically motivated models for non-linear gravitational clustering, such as the Zcl'dovich approximation (Zel'dovich 
1970) or alternative algorithms (e.g. Sahni & Coles 1995 for a recent review), like the adhesion (Gurbatov, Saichev 
& Shandarin 1989), frozen-flow (Matarrese et al. 1992) and frozen-potential (Brainerd, Scherrer & Villumsen 1993; 
Bagla & Padmanabhan 1994) approximations, which allow one to follow the time-evolution of the clustering of 
coUisionless matter on a wide range of scales and epochs. However, in contrast to the Hamilton et al. (1991) ansatz 
and later modifications of it (Peacock & Dodds 1994; Jain, Mo & White 1995; Peacock & Dodds 1996), none of these 
alternative algorithms exactly reproduces the correct very small-scale (i.e. strongly non-linear) behaviour. 

An approach similar to the one by Hamilton et al. (1991) has been applied by Peacock & Dodds (1994, 1996) in 
Fourier space to evolve the dimensionless power-spectrum (actually the contribution to the variance per unit In k) 
into the non-linear regime. These types of approaches have been tested against numerical simulations in the linear 
and mildly non-linear regimes by Baugh & Gaztanaga (1996). Padmanabhan (1996), Bagla & Padmanabhan (1996), 
Padmanabhan et al. (1996) and Munshi & Padmanabhan (1996) have investigated the more strongly non-linear 
regime, the latter authors in particular questioning the applicability of the stable clustering limit. Sheth & ,]ain 
(1996) suggest that the limit of stable clustering on small scales is only expected to ensue on scales where the density 
contrast exceeds ~ 300 — 600. This issue remains controversial. 

Besides the problems connected to the non-linear evolution of the clustering of the dark matter, one also has to 
face further non-trivial problems, such as the correct definition of the linear bias factor relating - in the simplest pos- 
sible case - the object number-density fluctuations to the mass-density fluctuation field, and its redshift dependence. 
Although the idea that cosmic objects of different types might be biased is ubiquitous in modern cosmology, the 
models that have so far been constructed are relatively crude. This question is therefore one of the major stumbling 
blocks to further progress in the analysis of cosmological structure formation. The problem is particularly crucial 
here, for do we need to know not only the form of the bias at the present epoch but also its redshift -dependence. 

One also has to take care to consider exactly how the 'intrinsic' redshift-dependent correlation function of any 
given class of objects is related to the 'observed' one, i.e. to the statistic one is actually able to extract from the data, 
which usually involves convolving with the redshift distribution of objects in the catalogue, accounting for the mask, 
correcting for redshift-space distortions and for the magnification bias induced by weak gravitational lensing, and 
finally adopting an appropriate choice for the redshift binning. 

The main point wc arc making is that one needs to be very careful indeed about drawing conclusions on the 
behaviour of the intrinsic correlation function of the given objects and on the scenario of galaxy formation from the 
observed clustering of high-redshift objects, because of intervention of these difficulties. Constructing a theoretical 
framework for modelling these effects is the main task of this work. The plan of the paper is as follows. In Section 
2 we define the 'observed' spatial, angular and projected two-point correlation function for high-redshift objects 
and discuss its connection with the underlying mass autocorrelation function. In Section 3 we present a theoretical 
model for the redshift evolution of the correlations of the underlying matter distribution. In Section 4 we introduce a 
generalized model for the bias of a class of objects and how the bias might evolve with epoch, discussing some special 
cases of this model in some detail. In Section 5 we apply our formalism to the clustering of QSOs and galaxies at 
high redshift. Conclusions are drawn in Section 6, where we also discuss the relationship of this work to alternative 
analyses of clustering evolution. 



2 DESCRIPTION OF CLUSTERING 

Let riobs(2.7; z, M) be the number-density of objects with redshift z that an observer placed in the origin measures in 
the angular direction specified by the unit vector 7. Here x = x{z) is the comoving radial coordinate corresponding 
to the redshift z, which, in a matter dominated universe is given by Mattig's formula (e.g. Peebles 1980) 

, . ^ 2c naz + (no-2)[-l + {noz + lY'^\ ^ 2cA{z) 

aoHo ^1(1 + z) ~aoHonl(l + z)' ^' 

Qo being the density parameter and Ho the present value of Bubble's constant. Note that, while in the Einstein-de 
Sitter case oo is an arbitrary length-scale which can be set to unity, in the non-fiat case it is given by 
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ao = ^\l-no\-'^\ (4) 

In order to convert dimensionless comoving coordinates to physical comoving ones, i.e. expressed in Mpc, one should 
always multiply by ao; so, for instance, the comoving radial distance of an object with radial coordinate x{z) is 
r{z) = aox{z). The inverse relation of (3) is also useful: 

-, , _^ 2y(l-no) + no + (2-no)[l + ;/^(l-no)]^/^ Hoaox 

In adopting the above relations we are implicitly assuming that redshift distortions are negligible, which is a good 
approximation in dealing with high redshift objects. The object number-density will generally also depend on other 
quantities such as mass, luminosity or absolute magnitude in a given wave band, etc., which we generally indicated by 
the variable M; nobs(a;7; z, M) will be taken to represent the observed number-density of objects per unit logarithmic 
interval of M. In the particular models discussed in Sec. 4, however, we shall make the assumption that relevant 
physical parameters of the object are tightly correlated with the halo mass and, in that context, we shall simply take 
M to mean the mass of the parent halo. 

The probability per unit solid angle to observe an object of type M with redshift in the interval z,z + dz is 

/ ^^nobB{x-y;z,M)g{z)dz, (6) 

where 



g{z) = al{l + z)-^ 



i-^^^(ao-i) 



2 ax 
dz 

-1/2 



_no(l+£L_ 1/2 



(7) 



The joint probability per squared unit solid angle to observe two objects of type Mi and M2 at redshifts zi and Z2 
in directions separated by an angle iD is 



dn-yi di}j2 . _ cos'ff^nohsixi^i; zi, Mi)nobs{x2^2; Z2, M2)g{zi)g{z2)dzidz2, (8) 



L 277 

where 5^ stands for the Dirac delta function. 

In practice one calculates different quantities, namely the mean number of objects per unit solid angle with 
redshift in some range Z and M in a certain domain M, 



(9) 



Nn = [ d\nM' [ dz'g{z') [ ^UoU^'r, z' , M') 

and the mean number of pairs at fixed comoving separation r in the same redshift interval and M domain, 
Npair,{r)oc / dlnMidlnM2 / dzidz2g{zi)g{z2) / ^^^^^^<5^(7i • 72 - cosi?(a;i,a;2,r)) (10) 

nobs(a;'i7i; zi, Mi)nohs{x2^2; Z2, M2) , 

where '&{xi,X2,r) is the angle between two sides of a triangle whose radial coordinates are xi and X2, given that the 
third side has size r = aoXi2, as measured from the point with coordinate xi. In a flat geometry, 
2,2 2 

Q/ \ -|- a;2 — a;i2 , . 

cosi?(xi,X2,r) = , (11) 

2x1x2 

while in the most general non-flat universe case the relation must be obtained by inverting the expressions reported 
below. The observed number of objects will generally differ from the real one by the catalogue selection function 
4>{z,M), which, for simplicity, we assume to be isotropic, Mobs (2:7; M) = (f>{z, M)n{x^; z, M), where n{x^; z, M) 
is the 'real' number density of objects. Once the selection function is known, the mean number of pairs in the 
sample can be compared with different realizations of a theoretical model, corresponding to different observers. In 
practice, however, it is easier to compare the observed number of pairs directly with the ensemble average of the same 
quantity in the theory. Angular brackets will denote ensemble averages (or averages over different spatial locations 
of the observer). The ensemble averaged mean number-density of objects of type M and redshift z will then be 
nobs(«, M) = <f>{z, M){n{z, M)). The two averages commute, but the ensemble averaged number-density is direction 
independent, so the further angular mean becomes unnecessary, ri.obs(2, M) = 4>{z, M)(n(x; z, M)) = (^(«, M)n{z, M). 
Similarly, the ensemble averaged mean number of pairs with comoving separation r, (iVpairs(f')), depends on the cross- 
correlation 0(«i, Mi)(/)(z2, Af2)(n(a;i7i; «i, Mi)n(a;272; 22, M2)): from the statistical homogeneity and isotropy of the 
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random field n(x;z, M), it follows that the quantity in angular brackets is unafi'ected by quasi-translations (e.g. 
Weinberg 1972, pg. 413) and rotations, so that its only spatial dependence can be on the separation R12 of the point 
(2^2,72) as measured from the point (2:1,71) (e.g. Osmer 1981; Shanks & Boyle 1994^, namely 

-R12 = ao{D^xl + xl - 2Da;iX27i ■ 72}^''^ , (12) 
where 



D=[l- yU^o - 1)] + - 1 - (1 - yU^o - 1)) 



X2 



.1/2 



7i ■ 72 , (13) 



where yi = H^aoXi/c. Equation ( p^ reduces to the usual cosine rule, i?i2 = ao^J x\ + X2 — 2a;ia;27i • 72 in the 
Einstein-de Sitter case. Notice that the formula (12) for R\2 is not symmetric in x\ and X2- This is, however, only 
important if D differs significantly from unity which effectively means that one is dealing with distances of order the 
curvature radius. Since in any case we expect correlations from objects at such large separations to be very weak 
(Stebbins & Caldwell 1995) it is possible to ignore this effect for reasonable cosmological models. One can then replace 
the above cross-correlation into the expression for (iVpairs(r)) and perform the angular integrations; the effect of the 
Dirac delta function will then be that of fixing i?i2 = r. By normalising to the expected number of Poisson pairs in 
the same sample, one obtains (an ensemble average estimate for) the correlation function actually measured in the 
sample, namely 

^ohs{r)=N'^ dzidz2 dlnMidlnAf2 Ar(zi,Mi) A/'(z2,M2)((5„(xi;«i,Mi)5„(x2;22,M2)) , (14) 

J Z J M 

where (5„(x;z,M) = [n(x; z, M) — n{z, M)]/n{z, M), while X2 is any point whose separation from xi is r. We also 
introduced the quantity N{z,M) = ATig{z)(f){z, M)n{z, M), representing the number of objects in the sample with 
intrinsic property M in the range (InM, InM + d\nM) and redshift in the range {z, z + dz), and its integrals 
JV{z) = J^dlnM' M{z,M') and TV = J^dz'Af{z'), with obvious physical interpretation. Given our assumptions, 
negligible redshift distortions and an isotropic selection function, the latter formula is exact and general. It shows that, 
in principle, one should know the cross-correlation of the random field n(x; z, M) at two different z and M . In order 
to proceed, however, we need to make some approximations. First of all we will assume separation of property M and 
position in the form of a linear biasing factor relating the object number-density fluctuation 5„ to the mass-density 
fluctuation 5m, namely (5„(x;2,M) ~ 6(A/, z)5m(x; z), which is a reasonable approximation as long as a calculation 
of the two-point function is concerned. In such a case we flnd 

iohsir) = iV"^ j dzidz2 J^{zi) U{Z2) 6eff (^i) fecff(z2)(<5m(xi; zi)5,„(x2; Z2)) , (15) 

where, again, X2 is any point whose distance from xi is r, and we introduced the effective bias factor 

1 /■ , , rd\nM'b(M,z)(j){z,M')n(z,M') 

K«{z)^N{zr' dlnW b{M,z) U{z,M') ; ' / 7 ' I . (16) 

Jm Jj^dlnM' (j>{z,M') n{z,M') 

In fact, in a wide class of biasing scenarios one expects the bias parameter to be a monotonically decreasing function 
of scale (Coles 1993), so that the shape of the correlation function of objects at small scales is expected to be steeper 
than that of the mass. However, for simplicity we shall neglect this point in the subsequent analysis and assume that 
the bias can be described by a constant multiplier of the matter correlation function. We discuss the importance of 
bias in the study of clustering evolution at length in Section 4. 

Next, we make the approximation that the dominant contribution to the integral in equation ( p^ comes from 
points whose redshifts are nearly the same. As far as the integral is concerned, we can then make the replacement 
((5m(xi; 2i)(5r„(x2; 22)) — > ^(r, Zavc), whcre ^(r, Zavo) is the mass autocorrelation function at some intermediate red- 
shift 2ave which we can take as Zave = z = {zi + Z2)/2^ A rough criterion for the validity of this approximation is 
\(," {r, z)/^{r, z)\Az^ ^ 1, having indicated by a prime the differentiation with respect to 2, where Az is the width of 
the given redshift range; this point is discussed in more detail by Porciani (1996, in preparation). At this level the 
possible spatial curvature of the universe can be assumed to affect £,{r,z) only through its time-evolution, since the 
comoving separation of any correlated pairs at the same redshift is certainly much smaller than the curvature length. 
We can then write 



* The formulae given by Shanks & Boyle (1994) contain two errors: first, their comoving radial coordinates are incorrectly 
normalised, which leads to wrong determinations of the object separations for any Qq 7^ 0, 1; second, D is incorrectly defined 
as the square root of the true one. 

^ Estimating 2ave through z{x) instead of z would imply a negligible correction in the final integral (actually less than a few 
percent for separations larger than one Mpc, in the model considered below). 
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^{r,^) = 7^ I dkk^P{k,z)jo{kr) , (17) 







with P{k, z) the mass fluctuation power-spectrum at z. The symbol ji will be generally used to denote the spherical 
Bessel functions of order i. We then arrive to the simple expression 

Cobs(r) = N-^ / dz^dz2 N(z^) N{z2) b,a{zi) b,s{z2) ^{r,z) . (18) 



J z 

The observed correlation function in a given redshift interval is a suitable weighting of the mass autocorrelation 
function with the mean number of objects and effective bias factor in that range. Only in the limiting case where 
the objects belong to a narrow redshift interval one is allowed to approximate ^obs by the linear relation £,obB{r, z) ~ 
b'^ff{z)(,{r,z), where z is the median redshift of the sample. 

Using similar reasoning we can write a simple expression for the observed angular correlation function in a 
certain redshift interval and M domain, in terms of the mass autocorrelation function. The result is an extension 
of the relativistic Limber's formula (e.g. Peebles 1980, Sect. 56), accounting for a linear bias factor in the relation 
between 5„ and Sm- One gets 

UJobsi^) = N~'^ J d2ldz2 A/'(2l) A/'(22) 6cff (21) 6cff(22)C(r-12, Zavo) , (19) 



where, neglecting the curvature corrections, ri2 = 001/2:^(21) + 2:^(22) — 2x{zi)x{z2) cosi?, with x{z) given by Mat- 
tig's formula, equation (^. Adopting as usual the small-angle approximation, one can easily obtain the handier 
form 



UohsW ^ dz G{z) J\f\z) bis{z) du^{r{u,^,z),z) , (20) 

J 2 —00 

where r{u,'d,z) = aoy^'u'^ + x'^{z)'d^, with x{z) given once again by equation and 



^0(1 + ^) fa , , ^^l/2 
J7o2 + l)V2-("«^ + l) 



(21) 



Here the approximation of replacing the two redshift dependence of the mass autocorrelation function by a single 
intermediate redshift 2avo corresponds to the standard procedure (e.g. Peebles 1980, p. 215). 

Another useful quantity to consider is the projected real-space correlation function for objects in the redshift 
band Z. This statistic can be easily obtained by applying directly the Davis & Peebles (1983) technique to our Cobs('') 
above. Defining rp as the separation of a pair perpendicular to the line of sight, rp = (ri + 7-2) tan(j?i2/2), with ri 
and r2 the comoving radial coordinates of the members and i9i2 their angular separation, one simply has 



dy Us{\/rp + y') = 2 / dr r (r' - r^)"'/" eobs(r) . (22) 



Wohs[rp) = 

In deriving this formula we have neglected peculiar motions, which, however, mostly affect separations along the line 
of sight, and assumed Euclidean geometry, which is a reasonable approximation as long as the underlying cross- 
correlation function is evaluated at a common mean redshift, as in equation (p^. 

As recently pointed out by Villumsen (1995), if the redshift distribution of faint galaxies is estimated by ap- 
plying an apparent magnitude selection criterion, then there is a possibility that a magnification bias due to weak 
gravitational lensing (Turner 1980) would intervene in the relation between the intrinsic spatial correlation function 
and the observed angular one. A similar problem could also affect the observed spatial correlation function of high 
redshift objects. This effect has not been taken into account in this work but we note that it would tend to increase 
the observed correlation function of high-redshift objects by adding a source of extra apparent correlations over and 
above that produced by intrinsic spatial correlations of the objects themselves. 



3 MODELLING THE EVOLUTION OF MATTER CORRELATIONS 

In the light of the preceding discussion it is clear that a theory capable of predicting the observed correlation function 
of some class of objects must first incorporate a model for the initial linear autocorrelation function of the primordial 
density fiuctuations and also a model describing how these correlations evolve with time. Only after this has been 
constructed does it make sense to worry about how a given class of objects relates to the mass distribution. We 
defer discussion of this second task to section 4; in this section we concentrate exclusively on the evolution of matter 
correlations. 
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As an illustrative example to which we shall apply our method, we assume the phenomenological linear power- 
spectrum obtained by Peacock & Dodds (1996) by fitting correlation results coming from different samples of galaxies 
and clusters. In their analysis they consider a linear power-spectrum of the form 

Pnn{k,0) = Pok"T\k) , (23) 
with the CDM transfer function, as given by Bardeen et al. (1986), 

T{k) = ^"(^+3^^^^^^) [1 + 3.89g + (16.1g)^ + (5.46g)^ + (6.71g)^] , (24) 

and q = {k/ hMpc~^) /F cB , where h, as usual, is the Hubble constant in units of 100 km s~^ Mpc~^. Peacock & Dodds 
(1996) find the following good fit for the effective shape parameter Fcff = 0.255 ± 0.017 + 0.32(l/n — 1) originally 
introduced by Efstathiou, Bond & White (1992). In CDM models Fcff is related to the density parameter Qq and to 
the baryonic fraction Q,b by the relation (Sugiyama 1995) Fcff ~ f2o/i exp[— f26(l + l/ilo)]- White et al. (1996) have 
recently discussed critical-density CDM models with high baryon content, as suggested by recent observations, which 
would help to reduce Fcff below the 'standard' value of 0.5. In the following analysis, however, we take Fch = 0.25 as 
an empirical parameter, without any relation to a true CDM model, assuming flo = 1. 

The normalisation is then fixed consistently with the 4-year COBE data which give QrmsPS = 18 ± 1.6 /iK 
(Bennett et al. 1996; Gorski et al. 1996), for n — 1, which we assume here, and To — 2.728 ± 0.004 K (Fixsen et al. 
1996). This leads to Po ~ 6.74 x 10^ ih~^ Mpc)^ and (Tg = criin(8 /i~^Mpc) = 0.65, in reasonably good agreement with 
the Peacock & Dodds (1994) amplitude. A more recent analysis of the COBE 4-year data by Bunn & White (1996), 
who used a Karhunen-Loeve expansion to produce an unbiased estimate of the normalisation, implies an increase in 
the amplitude of the power-spectrum of order 16 per cent for a standard CDM model; but changing the normalisation 
of our phenomenological model in this way will not alter the conclusions of this paper. 

The question of how the matter correlations evolve into the non-linear regime has been the subject of considerable 
recent research activity. A large fraction of the literature on the clustering of high-redshift objects uses either the self- 
similarity relation for stable clustering or ad hoc generalizations of it, as in equations (|l|) and (^), which also assume 
separation of the spatial (in proper coordinates) and redshift dependence (e.g. Peebles 1980). The self-similarity 
relation, obtained from equation (^) with e = (e.g. Peebles 1980), is expected to be the asymptotic state resulting 
from the evolution of scale-free initial conditions, Piin(fc,0) oc fc" (—3 < n < 4), in an Einstein-de Sitter universe, 
and leading to 7 = 3(3 + n)/(5 + n). For more general initial power-spectra, the first relation, with e = 0, should 
still be valid in the stable clustering regime. However, it is well known that stable clustering describes the evolution 
of the mass autocorrelation function only in the strongly non-linear limit, ^ ^ 100 (e.g. Efstathiou et al. 1988), 
under the hypothesis that the peculiar relative velocity of the pair exactly balances the Hubble fiow to form a bound 
configuration. This model only applies on very small scales (e.g. Efstathiou et al. 1988). On the other extreme, one 
has the linear theory prediction 

^nn{T,z)^Dl{z)i{r,Q) , (25) 

where D+{z) is the growing mode of linear perturbations [D+{z) = (1 + jz)"^ in an Einstein-de Sitter model], which 
only applies to large scales and/or at early times, i.e. when ^ ^ 1. Scale-free initial conditions linearly evolved in an 
Einstein-de Sitter universe lead to the form of equation (|^), with e = n + 2 and 7 = n + 3. 

What one really needs is a model able to smoothly interpolate among the two above regimes, so that it can be 
safely applied in the mildly non-linear regime, i.e. that relevant for most objects at the relevant separations. Following 
Hamilton et al. (1991), various authors have recently obtained fits of the N-body results of the type 

^{r, z) = B(neff )P[6i„(ro, z)/B{n,s)] , ro = [1 + ^{r, z)f'\ , (26) 

where 

3 f 3 /""^ 

^{r,z) = -, y\{y,z)dy^— dkkP{k,z)j^{kr) , (27) 



'Jo '^^ ' Jo 

and similarly for in terms of the linear power-spectrum, Piin{k,z) = D^{z)P\inik,0). In Hamilton et al. (1991) 
the B{ncft) factor was set to unity. For the Einstein-de Sitter case, Jain et al. (1995, hereafter JMW) gave theoretical 
arguments for _B(ncff) ~ (1 +ncft/3)'''*, with UcS the effective spectral index, which for a general, i.e. non-scale-free, 
initial power-spectrum is defined as 

dlnPlin(fc,2:) I 

noS[z) = ; , (28) 

dink \k=k„i{z) 

where k~j^ is the radius of a top-hat window function at which the rms linear mass fluctuation is unity. The JMW 
formula, which reads 
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^("^^^ 1 + 0.02X3 +0.003X^/^ > X = C,.„(ro,.)/BKff), (29) 

fits the N-body results, in tlie range of scales and redshifts where it is testable, with an accuracy better than twenty 
per cent. The JMW fit seems to perform better than the others (Hamilton et al. 1991; Peacock & Dodds 1994, 1996) 
in the highly non-linear regime, which is relevant if one needs to predict the amount of clustering at early epochs 
(e.g. Baugh & Gaztafiaga 1996). The formula (29) differs from that used by Peacock & Dodds (1996) slightly in the 
asymptotic behaviour for small separations. 

Here the linear prediction is obtained for X ^ 0, while the stable clustering behaviour is asymptotically recovered 
for X — > oo. To obtain the differential correlation function, ^(r, z), we need to differentiate the above relation with 
respect to r. We obtain 

, [l + i3(n.ff)F(X)]F'(X)Ag„„(ro,.) 
1 + B{ncs)t (X) - '(X)A5iin(ro, z) 
with F' = dF/dX and 

1 

A^Un{ro,z) = ^iin{ro,z) - ^iin{ro,z) = J dkk^Piin{k, z)[jo{kro) - {3/kro)ji{kro)] . (31) 

In Fig. 1 we show the mass autocorrelation function ^(r, z) obtained from the model above and compare it with 
the linear theory and stable clustering predictions for the same linear power-spectrum. Note that the stable clustering 
model always tends to largely overestimate the amplitude of the correlation function on quasi-linear scales. 

The qualitative effect of convolving the intrinsic object correlation function b'^g(z)£,{r, z) with the redshift distri- 
bution Af{z), as prescribed by equation (|l8|), can be that of partially diluting the signal. Quantitatively the effect can 
be estimated by using a very simple model: we assume no bias, 6=1, uniform selection function, ^(z, M) — constant, 
and constant comoving object number-density, M{z) oc {1 + zY' g{z). Integration over a typical redshift range A.z = 2, 
centered on a median redshift of 1, shows that the observed correlation function systematically underestimates the 
intrinsic one by about ten per cent. The effect can be higher in case the redshift-dependence of either the catalogue 
selection function, or of the intrinsic object number-density, preferentially selects lower redshift objects, which are 
generally less biased. 

An extension of this formalism to the open universe case has been obtained for the evolved power-spectrum 
in terms of the linear one by Peacock & Dodds (1996). The simple idea behind their extension is that if collapse 
occurs at high redshift, when Q. was very close to unity, then non-linear correlations at fixed proper separation still 
evolve as (l + z)""^ at low redshift, what changes instead is the linear growth factor which is suppressed by a known 
^-dependent factor. We shall defer discussion of clustering in open universe models to a later paper. 



4 EVOLUTION OF BIAS 

We now turn to the issue of how objects trace the mass distribution, i.e. to the question of the bias and how it might 
evolve with epoch. In the framework we are considering here this basically boils down to the behaviour of the linear 
bias parameter b, where &^ is the factor by which the two-point correlation function of a given class of objects exceeds 
the autocorrelation function of the mass fluctuations. The clustering of observable objects is more difficult to model 
satisfactorily than that of their parent haloes, being sensitive to various cosmological and astrophysical effects as well 
as to the mass - or any other intrinsic property - dependence of the selection function. There are various possibilities 
as to how such a bias might originate, and different biasing schemes are expected to introduce a bias that depends 
on redshift in different ways. In this paper we shall discuss four simple models which incorporate evolutionary effects 
in a relatively straightforward way. 

The first model we consider is the trivial one in which objects trace the mass at all epochs or, in terms of the 
bias parameter, that b = 1 for all values of z. We shall use this model as a kind of reference standard against which to 
compare the behaviour of other, more complicated biasing scenarios. For short, we shall call this the unbiased model. 

The next two models we consider are motivated by a recent discussion by Mo & White (1995) who, working within 
the Press-Schechter (1974) formalism, obtained an approximate formula (expected to be valid at large separations) for 
the linear bias relating the correlation function of dark matter haloes of mass M to that of mass-density fiuctuations. 
As we mentioned in Sec. 2, we now assume that physical parameters of the objects forming in haloes of mass M are 
tightly correlated with M, so that the general 'type' parameter we used there [M) is now taken to mean the mass of 
the parent halo. The assumption that the mass of the host halo determines the parameters of the object it contains 
may not be correct in detail, but it does allow the construction of relatively simple models that illustrate the way our 
formalism works. What emerges from these arguments is an expression for the bias parameter of dark matter haloes 
of mass M at redshift z, which formed at redshift Zf. The Mo & White (1985) formula reads 
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Chaio(r,2,M) = b^{M,z\zf)^{r,z) , (32) 
with 

b{M,z\zf) = l + ^{uj-l) (33) 

in which i/f = (5//(Tiin(z, M) and Sf = 5c£'+(2:)/-D+(2/). Here (5c is the critical linear overdensity for spherical 
collapse {5c = 1.69 in the Einstein-de Sitter case), and af^^ is the linear mass-variance at redshift z filtered in a 
sharp-edged sphere containing the mean mass Al. The expression (33) implies a minimum value for the bias factor 
b{M,z\zf) > 1 — 1/Sc (0.41 for S^o ~ 1). The most relevant feature of this formula in this context is that it predicts 
b Ri 1 at = 1, which for n{z, M) represented by the Press-Schechter mass-function, 



n[z 



{z,M) dlnM = \ - 



2 ^o(l + 



2 



^3 



TV MD+{z)aiin{0,M) 



dlncrii„(0,M) 



dlnM 



exp 



2Dl{z)aU0,M) 



dlnM , (34) 



(with go the mean mass density at 2 = 0), roughly corresponds to the peak mass M^(2/), implicitly defined by 
aiin{zf, Mi,) = Sc/V^. We are interested not in b{M, z\zf) but the bias of all haloes of mass M that exist at an epoch 
2, which we call b{M,z). This latter quantity is, in principle, obtained by integrating equation (33) over all Zf > z 
taking into account the relevant survival probability of a halo of mass M formed at Zf not being destroyed by merging 
by the epoch z. It is, however, implicit in the Press-Schechter approach that the only objects that exist at any 
epoch 2 are those which have just formed by the merging of smaller mass units (Sahni & Coles 1995, Sec. 3.4). This 
assumption may be questionable, but it is nevertheless consistent, within the Press-Schechter formalism, to argue 
that b{M,z) = fo(M, 2|2/ = z). Looked at in this way, the formula (33) leads to the result one would expect from 
the so-called peak-background split (Efstathiou et al. 1988; Cole & Kaiser 1989), and reduces to the standard form 
b ~ Sc/cr^ in the limit of high peaks (Kaiser 1984), implying a D^'^{z) (i.e. (1 + 2)^ if Slo = 1) redshift dependence 
in such a limit. 

The task of obtaining the form of b{M, 2) for objects undergoing dissipative collapse, such as galaxies, is rather 
more involved than just finding this function for their haloes. While haloes may merge rapidly and lose their identity, 
the same is not necessarily true for the galaxies they contain. For this reason. Mo, Jing & White (1996) prefer to use 
the formula (33) for pairs of galaxy clusters with separations r > 10 Mpc, assuming that a cluster can indeed 
be thought of as a single large halo, despite the fact that it contains many separate galaxies. There is therefore some 
ambiguity in how to translate the formula (33) to a specific situation of a particular class of objects. 

One way to incorporate the above formulae into a model for the observed clustering of objects is to assume that 
one can observe all haloes exceeding a certain cutoff mass Mmin at any particular redshift, i.e. (j>[z, M) = 0(A/ — A/min) 
at any 2, where O(-) is the Heaviside step function. One then has to weight the correlation function of haloes of mass 
M by the appropriate number density. We can thus model the linear bias at redshift 2 for haloes of mass M as in 
equation (^) and get the redshift evolution of the effective bias factor bcs{z) through equation (|l6|), by weighting it 
consistently with the Press-Schechter mass-function n{z,M). A plot of &cff(2) for different choices of the minimum 
cutoff mass in n{z,M) is shown in Fig. 2. The 2-dependence of b^s changes with Afmin, which also determines its 
present value, with the constant value 0.41, obtained for Afmin ^ 0. (Note that, in principle, Afmin may depend on 
the depth of the catalogue.) Depending on the mass spectrum considered, the redshift dependence of b^g can be such 
as to partially or even completely balance the evolution of the mass autocorrelation function. The behaviour of fecft 
emerging from such a model can be well-fitted by using the relation 

6eff(2) = 0.41 + [beff (2 = 0) - 0.41](1 + zf . (35) 

The resulting fitting parameters are reported in Table ^ for our choice of initial matter fiuctuation spectrum. One 
important feature of this model is that specifying a minimum mass for the objects determines their effective bias 
factor at 2 = 0. If the value required does not correspond to the empirically-determined bias parameter for any 
known class of objects then one must accept that such objects are transient and have faded sufficiently since the 
observed epoch to be missing from local redshift surveys. Alternatively, it may be that observational selection effects 
make objects visible at high redshift, but invisible at small distances, such as might be the case for extended objects 
of low surface brightness. We shall call this the transient model in the following discussion. 

An alternative scenario based on these considerations is to fix the effective bias parameter at 2 = so that it 
agrees with a known population of present-day objects using bo — bcff(O) « l/o"8' ~ 1-46 - where by ct"' we mean 
the non-linearly evolved rms mass fiuctuation - and evolving it as 

beff(2) =b_l + (bo-b_l)(l + 2)'^ , (36) 

with suitable parameters b_i = b[z —1) and 13; fixing bo effectively determines Afmin while the previous calculation 
fixed Afniin in order to determine bo. If one therefore imagines that a given population of objects at the present epoch 
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[tp] 

Table 1. The fitting parameters of the effective bias b(,ff by using equation (|35[). Column 1: the minimum cutoff mass (in 
Mq). Column 2: the value of b^g at 2: = 0. Column 3: the parameter /3. 



109 


0.50 


1.92 ±0.01 




0.55 


1.89 ±0.01 


10" 


0.67 


1.85 ±0.01 


IOI2 


0.90 


1.79 ±0.01 


IQia 


1.41 


1.76 ±0.02 


10" 


2.67 


1.78 ±0.03 



(with a 'known' value of 60) form via the merging of lower mass haloes, such as is the case in hierarchical clustering 
models, then this model would appear to be appropriate. For short we shall call this merging model in the subsequent 
discussion. 

The crucial difference between the transient model and the merging model is that the latter requires that one 
identify the population of objects one is observing at high z with the progenitors of present-day bright galaxies while 
the former does not. 

Our final alternative is motivated by different arguments. According to Nusser & Davis (1994) and Fry (1996), 
if galaxies form at some characteristic redshift 2/ by some non-linear process which induces a bias parameter at that 
epoch bf and if their subsequent motion is purely caused by gravity, then continuity leads to a redshift -dependent 
bias factor which evolves approximately as 

b{z)^l + {bf-l)^-^ , z<zf, (37) 

1 + Zf 

for Qq — 1. The obvious consequence of this simple model is that the bias factor always tends to unity as time goes 
on. Allowing for some spread in the values of 6/ and zf, depending e.g. on the mass of the galaxy, would not greatly 
change this evolution law. We call this model the object-conserving model. Equation (37) is consistent with the Mo 
& White (1995) formula, equation (33), provided that the effect of merging is negligible. 

A linear dependence on 1 + 2: as obtained in the object-conserving model would also be obtained if objects were 
assumed to form at high peaks of the linear density field above some threshold 5c — von^ threshold, with fixed 1/ in 
accordance with equation (33). This model has been recently adopted by Groom & Shanks (1996) to fit the quasar 
two-point function. 

Notice that all four of these models can be thought of as special cases of the merging model whose behaviour 
is described by equation (36). The transient model has its parameters fixed by the choice of halo mass cutoff. The 
merging model has parameters determined by the choice of bias parameter at 2 = 0. The object conserving model 
has 6-1 = 13 = 1, but otherwise free parameters. The case b = 1 for all epochs can also be accomodated by equation 
(36): a bias of unity in a merging model would occur if objects preferentially formed in haloes with mass close to A/* 
or if, for some reason, such objects were selected observationally. 

These models are not exhaustive of all the possibilities, but they do serve to illustrate the spread in possible 
behaviours expected with fairly minimal assumptions about the bias. Alternative biasing models, not discussed further 
in this paper, have been considered by, for example. Coles (1993) who discusses a general class of local bias models 
and Catelan et al. (1994), who define a weighted bias algorithm, according to which the biased density field coincides 
with the mass-density whenever the latter exceeds some fixed threshold. As mentioned in Section 2, 'generic' local 
bias models predict that & is a function of scale, a fact we shall ignore in this analysis. This introduces the possibility 
that the shape as well as the amplitude of the biasing relation changes with epoch, allowing for an even wider range 
of possibilities. We shall defer this question to future work. 



5 APPLICATIONS 

In this section we illustrate the use of our formalism by applying it to a example data sets. Given the mass autocorre- 
lation function, each class is characterized by its redshift distribution N{z), which automatically accounts for the 
catalogue selection function, and ii) the effective bias parameter 6cff(2) which we discussed in the previous Section. 
Note that our method is semi-empirical, in the following sense: the theoretical mass-function is only employed as the 
appropriate weighting factor to deduce the effective bias factor, but we never require that it predicts the observed 
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redshift distribution of objects in the catalogue, modulo the selection function; in our applications, in fact, we will 
take Af{z) as obtained from the data. 

5.1 Spatial QSO correlation function 

Quasars are a class of objects to which we may apply this formalism. Various authors have looked at the spatial 
quasar clustering and at their redshift evolution (e.g. Shanks et al. 1987; lovino & Shaver 1988; Andreani & Cristiani 
1992; Andreani et al. 1994; Shanks & Boyle 1994; Groom & Shanks 1996). 

According to Efstathiou & Rees (1988), Haehnelt (1993) and Katz et al. (1994), every halo of mass greater than 
some threshold Mmin ~ 10^^ — 10^^ Mq harbours a QSO with a lifetime tg which we can assume to be much less 
than the time it takes for the halo to lose its identity by merging. In this case QSOs existing at a particular epoch 
sparsely sample the distribution of haloes with mass greater than Mmin at that epoch. This implies that bcg{z) for 
quasars can be calculated using the transient model. 

The spatial correlation function of quasars in the redshift range 0.3 < z < 2.2 is shown in the upper panel of 
Fig. 3; the redshift distribution Af{z) for the Durham/AAT survey is taken from Shanks & Boyle (1994) while the 
observational data are the updated estimates from Groom & Shanks (1996). The effective bias bc:fi{z) is calculated 
from equation (|l^, where n{z,M) is modelled by the Press-Schechter mass-function, with cutoff mass in the range 
10^^ — 10^^ Mq. As it is clear from the plot, given the uncertainty on the minimum mass of haloes able to host 
a quasar, our GDM-like model accounts for all the observational data, with the (possible) exception of the point at 
~ 5h~^ Mpc, at which the difference is less than 2a. 

In the other panels of Fig. 3 we plot the spatial correlation function of the same QSO catalogue in two different 
redshift ranges: 0.3 < z < 1.4 (centre) and 1.4 < z < 2.2 (bottom). While the results at higher redshifts are in 
good agreement with the observational data, the points at r ~ 10 and at r > 20h~^ Mpc for 0.3 < z < 1.4 show a 
2a deviation. Notice that the predicted evolution of ^ over these ranges is rather small, even for the upper band of 
the shaded region. In fact, the data at these scales suggest stable clustering in comoving coordinates. The possible 
discrepancies between the model and the data, however, are on very small scales, where ^obs > 1. This suggests that 
a more decisive test of this model could be constructed by quantifying the clustering of QSOs on scales smaller than 
Tc. Notice that there is some indication that a scale-dependent bias is required to reconcile the observed correlation 
function with that predicted by our model, but that this is to some extent predicted by realistic biasing scenarios 
(Goles 1993). 

5.2 Angular and projected galaxy correlation functions 

While the choice of bias model seems relatively straightforward for QSOs, this is is not the case for galaxies. Various 
plausible arguments can be made for choosing any of the models we discussed in Section 4. 

The first possibility we consider is that galaxies represent at any redshift an unbiased sample of the mass 
distribution, i.e. that baft{z) = 1. According to the model of equation (^3|), for the bias of dark matter haloes this 
situation would occur if galaxies were preferentially formed from haloes with mass close to Af*(z/) and survived until 
the epoch z. 

Another possibility would be that faint galaxies are subunits that merge to make up more luminous galaxies 
(Broadhurst, Ellis & Glazebrok 1992; Glements & Gouch 1996; Baugh, Gole & Frenk 1996), in which case, using the 
redshift dependence of feoff (z) obtained in the previous section for dark matter haloes, looks a sensible choice. This 
model too fits into equation (36), for fe_i — 0.41, bo — l/crg' — 1-46 and /3 « 1.8. 

One might also assume that faint galaxies form a transient population of dwarf galaxies which has now faded 
away (Babul & Rees 1992; Lacey et al. 1993). This would motivate the transient model which, as we have noted, can 
also be described by equation (36) . For illustrative purposes we incorporate a minimum mass of Mmin = lO"/l"^M0, 
with consequent values of the parameters given in Table 1. 

If galaxies form at some particular redshift and then evolve without losing their identity and thus simply following 
the continuity equation thereafter, the object-conserving model seems to be appropriate. In this picture, faint galaxies 
of a particular type would be identified with 'ordinary' luminous galaxies at high redshift (e.g. Tyson 1988). It is 
natural to apply this scenario to, for example, present-day bright spirals which appear to have survived undisturbed 
for some time by interactions or mergers with objects of a similar mass: the corresponding value of bo — 1/fg' ~ 1.46. 

Given the plausibility of each of these models, we now compare all of them with the best currently available data 
on the evolution of galaxy clustering with redshift using the model described in Section 2 to evolve the underlying 
matter correlations. 

In the top panel of Fig. 4 we plot the angular correlation of galaxies out to z = 1.6 for the Ganada-France 
Redshift Survey (GFRS). The galaxy redshift distribution is taken from Grampton et al. (1995). The observational 
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data have been obtained by Hudon & Lilly (1996) and refer to the same catalogue. These data are uncorrected for 
the integral constraint, spurious field to field variations or for stellar contamination. Two different methods were used 
by Hudon & Lilly to estimate uj{9) from the observational data: in the first the angular correlation is obtained by 
averaging the different estimates in each of the 24 fields composing the catalogue (local method, open circles); in the 
second the counting of pairs is done over all cells in the sample (global method, filled squares). The true unbiased 
angular correlation is bracketed by the two methods. 

The bottom panel of the same figure shows our prediction for a sample of 222 galaxies with z < 1.6 selected 
from the Hawaii Keck K-band survey (Cowie et al. 1996). The redshift distribution J\f{z) and the correlation data are 
taken from Carlberg et al. (1996). In order to take into account the dilution produced by the uncorrolated foreground 
stars, we correct the original data by a factor of (1 — /*)~^, where /* is the fraction of catalogued objects that are 
stars. Carlberg et al. (1996) estimate for their catalogue /« = 1/4. 

Note that the CFRS data are well described by both the unbiased and transient model while the Keck data, with 
their larger errors, are compatible with the galaxy-conserving model and, marginally, with the merging model. 

In Fig. 5 we present our prediction for galaxies in the Hubble Deep Field (HDF). Villumsen, Freudling & da 
Costa (1996) fit the redshift distribution by 

J\f{z) = 2.723^ exp[-(z/^o)^-^] , (38) 

^0 

where zo is approximately the median redshift. Alternative estimates of the redshift distribution for these galaxies 

are presented by Lanzotta, Yahil & Fernandez -Soto (1996), but for illustrative purposes wc adopt the former results 
here. We consider four samples with magnitude limits R = 26, 27, 28, 29, with corresponding median redshifts of 
Zo = 1.35, 1.54, 1.71, 1.87, respectively. Observational estimates of the correlation functions for these samples have 
also been obtained by Villumsen, Freudling & da Costa (1996). After correcting the results to include the integral 
constraint, they fitted their angular correlations by assuming a power law 

uj{e) = Ae-^ (39) 

with fixed slope 5 = 0.8. The shaded region in the plots refers to the la range allowed by their fits. 

Again the unbiased model appears to provide the best fit over all the the samples selected: the other biasing 
models yield too high an amplitude for the deeper samples. There is also some indication that the actual correlation 
function of galaxies in this catalogue is steeper than the model predicts, though this may in principle be accounted 
for by allowing the bias factor to depend on scale in the expected fashion (Coles 1993). 

In Fig. 6 wc plot the projected correlation function for galaxies with CFRS redshift distribution and compare 
it with observational data from the same catalogue (Le Fevre et al. 1996). The data are divided in three ranges in 
redshift: 0.2 < z < 0.5, 0.5 < z < 0.75 and 0.75 < z < 1. Note that the observational data, originally plotted as a 
function of the proper projected separation at the median redshift of three considered strips {z — 0.34, z — 0.62 and 
z = 0.86 respectively), have been rescaled to the comoving, projected separation Vp. Again the merging model and 
the number-conserving model overpredict the correlations, and the overprediction is worse for the samples at higher 
2. The unbiased and transient models are, however, good fits to the data. 

Fig. 7 shows our prediction for galaxies for the Keck sample, with redshift distribution and correlation data from 
Carlberg et al. (1996). Two different strips are here considered: 0.3 < z < 0.9 and 0.9 < z < 1.5. Observational data 
are rescaled as before at the median redshift, z = 0.6 and z = 1.1 respectively. Again the merging model seems to be 
excluded by the high redshift data; the other three alternatives are consistent. Again, there is some evidence that, at 
high redshift, the shape of the galaxy correlation function is wrong, possibly because of scale-dependence of b. 



6 CONCLUSIONS 

The principal result of this analysis is that uncertainties in the evolution of the bias parameter with epoch z are 

potentially the dominant consideration to be taken into account when testing theoretical models against galaxy 
clustering evolution data. Looking at this in another way, one can say that the rough consistency of observed clustering 
data at high redshift with unbiased models can place strong constraints on the bias invoked in any particular theory of 
structure formation. One has to be very careful about postulating a significant bias at z — without thinking carefully 
about how the bias was introduced, as plausible scenarios give a bias parameter that evolves strongly with cosmic 
epoch in contrast with the observations. Since all the bias models we consider can be described by the same formula 
(36), this basically argues that, at least within the class of models considered, the bias must be small at the present 
epoch so as not to increase too drastically with redshift. Notice further that any extra correlations introduced by 
the effects of magnification bias induced by gravitational lensing actually strengthen this conclusion, as the observed 
clustering must include both the intrinsic clustering of the objects and the contribution from correlated shear. 
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In the case of QSOs, we find that a simple model in which quasars trace the distribution of haloes is roughly 
consistent with the clustering data supplied by Groom & Shanks (1996) for all epochs. This conclusions contrasts 
with the claim of Groom & Shanks that the apparently low level of clustering evolution requires either a strong bias 
or a low density universe (or both). The important points of disagreement with Groom & Shanks (1996) are that (i) 
their conclusions are based on a standard GDM model (i.e. Feff = 0.5) for the initial power-spectrum, (ii) they use 
linear theory to evolve the matter fluctuations and {Hi) they adopt a model for the evolution of bias which resembles 
our object- conserving model. 

It is pertinent to compare our analysis of galaxy clustering data with a recent paper by Peacock (1996). By 
comparing estimates of the power-spectrum of clustering at z — and the GFRS data on redshift evolution with 
the expected non-linear evolution of a phenomenological model for the power-spectrum he concludes that only an 
unbiased model for galaxy correlations in a low density universe is compatible with the observations. We have not 
studied low-density models in this paper, so we do not comment on whether such models offer a better explanation 
of the power-spectrum. There are, however, some important points of difference between our analysis of Einstein-de 
Sitter models and that of Peacock (1996). First, the model for the initial power-spectrum adopted by Peacock is 
of a slighty different form to ours: he uses a model of two power laws with a transition region in between. Since 
our model is not exactly a power law on small scales there will be some differences in the consequent clustering 
behaviour starting from these two different initial assumptions. These differences might well alter the behaviour of 
the correlation function on the relatively small scales where its amplitude is known accurately. Secondly, Peacock 
(1996) uses a different fitting formula for the non-linear evolution of correlations, based on that of Peacock & Dodds 
(1996). Third, Peacock (1996) does not consider models for the evolution of b, preferring instead to assume that the 
bias factor has the same value at all redshifts. Finally, we do not include clustering data obtained from catalogues 
at zero redshift in our analysis because different surveys may either involve physically different objects or possess 
different observational selection functions (or both). 

These differences are important when we compare our main conclusion (that unbiased or mildly biased clustering 
developing from our initial spectrum in a universe with Oq = 1 is consistent with the clustering data) with Peacock's 
conclusion that he cannot reproduce the observed clustering with his model in a universe with Qo = 1 unless he 
invokes a particular form of biasing to counteract the small-scale evolution of the matter power-spectrum. Since the 
differences in our approaches manifest themselves most strongly on small scales, these are probably the origin of our 
disagreement on this point. In any case, we do not attempt to fit the detailed shape of the IRAS/APM power-spectrum 
(for reasons mentioned above) and this is his main argument that a specific form of the bias is necessary if Slo = 1. 
The GFRS and Keck data, in themselves, are well explained by our model. Nevertheless, it is obviously important 
to understand the origin of present-day bright galaxies such as those constituting the APM survey. It emerges from 
our analysis that there is a problem explaining the high value of bo ~ 1.4 required by our GDM-like model for the 
these galaxies. According to the models we have presented, a population of present-day objects with 6o > 1 should 
be identifiable with some class of high-redshift objects with an even higher value of b, unless the present-day objects 
are contained within haloes having a minimum mass of order 10^'^A/q (see Fig. 2) and which is very different from 
that corresponding to any observable high-redshift objects. An alternative solution to this conundrum that does not 
require a 'special' population of objects to be identified with present-day galaxies is to invoke f2o < 1, because then 
there is no need for bright galaxies at the present epoch to have bo > 1. A model with flo < 1 and in which bright 
galaxies trace the mass is probably consistent with both the high-redshift data and local measures of clustering, as 
argued by Peacock (1996). 

The discrepancies between our conclusions for both QSOs and galaxies and the studies of Groom & Shanks (1996) 
and Peacock (1996) respectively, serve as an important reminder of the model-dependence of this type of analysis and 
of some of the residual theoretical uncertainties. In particular, it is important to understand the small-scale stable 
clustering limit in more detail if this technology is to be improved. Above all, however, we need to develop much 
better defined models for the form and evolution of biasing in a general context, for even in the simplest case in which 
the bias is a constant linear multiplier there is considerable ambiguity in the modelling of its evolution. 
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FIGURE CAPTIONS 

Figure 1. The mass autocorrelation function ^ as a function of tlie comoving separation r, at different redshifts: 
z = 3 (top left), z = 2 (top right), z = 1 (bottom left) and z — (bottom right). Different lines refer to different 
models for the clustering evolution: JMW fitting formula (solid line), linear theory (dotted line) and stable clustering 
(dashed line). 

Figure 2. The effective bias bctt as a function of the redshift z. Different lines refer to different values of the minimum 
mass in the Press-Schechter mass-function, ranging from 10^ to 10^* Mq, from bottom to top. 
Figure 3. Theoretical prediction for the spatial correlation function of quasars, with Af{z) given by the Durham/AAT 
redshift distribution (Shanks & Boyle 1994). Different panels refer to different redshift ranges: whole range 0.3 < z < 
2.2 (top), 0.3 < z < 1.4 (centre) and 1.4 < z < 2.2 (bottom). The shaded region shows the values obtained if the 
effective bias is obtained with a minimum mass in the band Mmin ~ 10^^ (lower line) and 10^^ Mq (upper line). 
The open squares and relative error bars refer to the Groom & Shanks (1996) results for the observed QSO two-point 
function in the same sample. 

Figure 4. Theoretical prediction for the angular galaxy correlation function. Top panel: galaxies with z < 1.6 and 
Af{z) given by the CFRS redshift distribution (Crampton et al. 1995). Correlation data are from Hudon & Lilly (1996) 
and are obtained by using two different methods which bracket the true values: the local and global determinations 
are shown by open circles and filled squares, respectively. Bottom panel: galaxies in the redshift range < z < 1.6 
selected from the Hawaii Keck K-band survey, with redshift distribution and correlation data from Carlberg et al. 
(1996). The original data are corrected to take into account the dilution produced by the uncorrelated foreground 
stars. Different bias models are considered: unbiased model [b{z) = 1; solid line]; galaxy-conserving model [equation 
( ^6| ) with bo = 1.46, — (3 — 1; dotted line]; merging model equation ( ^^ with bo — 1.46, b_i = 0.41 and f3 — 1.8; 
short dashed line]; transient model [equation ( p6[ ) with bo — 0.67, 6_i = 0.41, /3 — 1.85; long dashed line]. The latter 
model almost coincides with the unbiased case. 

Figure 5. Theoretical prediction for the angular galaxy correlation function for the Ifubble Deep Field. The shaded 
region in the plots refers to the Icr range allowed by the fit obtained on the observational data by Villumsen, Freudling 
& da Costa (1996). The panels show the results for four R magnitude limited samples with different median redshift 
zo: R <26 and zq = 1.35 (top left), R < 27 and zq = 1.54 (top right), i? < 28 and zo = 1.71 (down left) and _R < 29 
and Zo — 1.87 (down right). Different bias models are shown as in Fig. 4. 

Figure 6. Theoretical prediction for the projected galaxy correlation function of the CFRS sample. The redshift 
distribution is given by Crampton et al. (1995). Correlation data are from Le Fevre et al. (1996). Different panels 
refer to different strips in redshift: 0.2 < z < 0.5 (top), 0.5 < z < 0.75 (centre) and 0.75 < 2 < 1 (bottom). Different 
bias models are shown as in Fig. 4. 

Figure 7. Theoretical prediction for the projected galaxy correlation function of the ffawaii Keck K-band survey. 
The redshift distribution and correlation data are from Carlberg et al. (1996). Different panels refer to different strips 
in redshift: 0.3 < z < 0.9 (top) and 0.9 < z < 1.5 (bottom). Different bias models are shown as in Fig. 4; the transient 
model almost coincides with the 6=1 (unbiased) model in the upper panel. 
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